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Abstract 

Characteristic models are an alternative, model based, representation for Horn expres- 
sions. It has been shown that these two representations are incomparable and each has 
its advantages over the other. It is therefore natural to ask what is the cost of translat- 
ing, back and forth, between these representations. Interestingly, the same translation 
questions arise in database theory, where it has applications to the design of relational 
databases. This paper studies the computational complexity of these problems. 

Our main result is that the two translation problems are equivalent under polyno- 
mial reductions, and that they are equivalent to the corresponding decision problem. 
Namely, translating is equivalent to deciding whether a given set of models is the set of 
characteristic models for a given Horn expression. 

We also relate these problems to the hypergraph transversal problem, a well known 
problem which is related to other applications in AI and for which no polynomial time 
algorithm is known. It is shown that in general our translation problems are at least as 
hard as the hypergraph transversal problem, and in a special case they are equivalent 
to it. 



1. Introduction 

The traditional form of representing knowledge in AI is through logical formulas (McCarthy, 
1958; McCarthy & Hayes, 1969), where all the logical conclusions of a given formula are 
assumed to be accessible to an agent. Recently, an alternative way of capturing such 
information has been developed (Kautz, Kearns, & Selman, 1995; Khardon & Roth, 1994). 
Instead of using a logical formula, the knowledge representation is composed of a particular 
subset of its models, the set of characteristic models. This set retains all the information 
about the formula, and is useful for various reasoning tasks. In particular, using model 
evaluation with the set of characteristic models, one can deduce whether another formula, 
a query presented to an agent, is implied by the knowledge or not. While characteristic 
models exist for arbitrary propositional formulas, in this paper we limit our attention to 
logical formulas which are in Horn form and to their representation as characteristic models. 

The characteristic models of Horn formulas have been shown to be useful. There is a lin- 
ear time deduction algorithm using this set, and abduction can be performed in polynomial 
time, while using formulas it is NP-Hard (Kautz et al., 1995). Furthermore, an algorithm 
for default reasoning using characteristic models has been developed, for cases where for- 
mula based algorithms are not known (Khardon & Roth, 1995). Hence, the question arises, 
whether one can efficiently translate a Horn formula into its set of characteristic models 
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and then use this set for the reasoning task. We denote this translation problem by CCM 
(for Computing Characteristic Models). 

On the other hand, given a set of assignments, it might be desirable to find the underlying 
structure behind this set of models. This is the case when one is trying to learn the 
structure of the world using a set of examples. This problem has been studied before under 
the name Structure Identification (Dechter & Pearl, 1992; Kautz et al., 1995; Kavvadias, 
Papadimitriou, & Sideri, 1993). Technically, the problem seeks an efficient translation from 
a set of characteristic models into a Horn expression that explains it. We denote this 
translation problem by SID (for Structure Identification). 

Interestingly, the same constructs appear in the theory of relational databases. As 
shown in a companion paper (Khardon, Mannila, & Roth, 1995), there is a correspondence 
between Horn expressions and Functional Dependencies, and a correspondence between 
characteristic models and an Armstrong relation. The equivalent question of translating 
between functional dependencies and Armstrong relations has been studied before (Beeri, 
Dowd, Fagin, & Statman, 1984; Mannila & Raiha, 1986; Eiter & Gottlob, 1991; Gottlob 
& Libkin, 1990) and is relevant for the design of relational databases (Mannila & Raiha, 
1986). While this paper does not discuss the problems in the database domain, some of the 
results presented here can be alternatively derived from previous results in database theory 
using the above mentioned equivalence. (We identify those precisely, later on.) However, 
this paper makes these results more accessible without resorting to any results in database 
theory, and with simpler proofs. On the other hand some new results are presented, which 
resolve a question which was open both in AI and in the database domain. 

1.1 An Example 

Let us introduce the problems in question through an example. Suppose the world has 4 
attributes denoted a, 5, c, rf, each taking a value in {0,1} to denote whether it is "on" or 
"off, and our knowledge is given by the following constraints: 

W = (bc-^ d)(cd b)(bc a). 

Then I^ is a Horn expression and it is normally used to decide whether certain constrains 
are implied by it or not. For example W |= (cd a), and W ^ (bd a), where the symbol 
1= stands for implication. This is normally performed by deriving a proof for the constraint 
in question. If no such proof exists then implication does not hold. In our example we 
would notice that (cd 5), and therefore (cd —^bc—^a). As for (bd a), we would fail to 
find a proof and therefore conclude that it is not implied by W. This general approach is 
called theorem proving, and is efficient for Horn expressions (Dowling & Gallier, 1984). 

An alternative approach is to check the implication relation by model checking. Im- 
plication is defined as follows: I^ |= a if every model of W is also a model of a (where 
X G {0, 1}" is a model of an expression / if / is evaluated to "truth" on x). So to decide 
whether I^ |= a we can simply use all the models of W, and check, one by one, whether 
any of them does not satisfy a. In our example W has 11 models: 

models(W) = {0000,0001,0010,0100,0101,1000,1001,1010,1100,1101,1111} 

(where the assignments denote the values assigned to abed correspondingly), and we would 
have to test a on every one of them. Unfortunately, in general the number of models may 



350 



Horn Expressions and Characteristic Models 



be very large, exponential in the number of variables, and therefore this procedure will not 
be efficient. 

The question arises therefore, whether there is a small subset of models which still 
guarantees correct results when used with the model checking procedure. Such a subset is 
called the set of characteristic models of W and its existence has been proved (Kautz et al., 
1995; Khardon & Roth, 1994). In our example this set is: 

chariW) = {0010, 0101, 1001, 1010, 1100, 1101, 1111}, 

so it includes 7 out of the 11 models of W. Model checking with this set is guaranteed 
to produce correct results for any a which is a Horn expression, and using a slightly more 
complicated algorithm one can answer correctly for every a (Kautz et al., 1995). In our 
example, it is easy to check that (cd a) is evaluated to "truth" on all the assignments in 
char(W) and that (bd a) is falsified by 0101. 

The utility of these representations, Horn expressions and characteristic models, is not 
comparable. Each of these representations has its advantages over the other. First, the size 
of these representations is incomparable. There are short Horn expressions for which the 
set of characteristic models is of exponential size, and vice versa, there are also exponential 
size Horn expressions for which the set of characteristic models is small (Kautz et al., 
1995). The representations also differ in the services which they support. On one hand, 
Horn expressions are more comprehensible. On the other hand characteristic models are 
advantageous in that they allow for efficient algorithms for abduction and default reasoning. 
In this paper we are asking how hard it is to translate between these representations, so as 
to enjoy the benefits of both. 

1.2 Overview of the Paper 

In this paper we study the complexity of the translation problems CCM and SID. For 
these problems, the output may be exponentially larger than the input. Therefore, it is 
appropriate to ask whether there are algorithms which can perform the above tasks in time 
which is polynomial in both the input size and the output size. These are called output 
polynomial algorithms. 

Before starting our investigation we note that it has been shown (Kautz et al., 1995) 
that using the set of characteristic models one can answer abduction queries related to H 
in polynomial time, while given the formula H it is NP-Hard to perform abduction (Selman 
& Levesque, 1990). This however does not imply that computing the set of characteristic 
models is NP-Hard since the construction in the proof yields a Horn formula whose set of 
characteristic models is of exponential size. 

Our main result says that CCM and SID are equivalent to each other, and are also 
equivalent to the corresponding decision problem. The problem of Characteristic Models 
Identification (CMI), is the problem of deciding, given a Horn expression H and a set of 
models G, whether G = char(H). We show that CCM, SID, and CMI are equivalent under 
polynomial reductions. Namely, the translation problems are solvable in polynomial time 
if and only if the decision problem is solvable in polynomial time. These are new results 
which have immediate corollaries in the database domain. 

We then show a close relationship between these problems and the Hypergraph Transver- 
sal Problem (HTR). Given a hypergraph G a transversal of its edges is a set of nodes which 



351 



Khardon 



touches every edge in the graph. In the HTR problem one is given a hypergraph as an 
input, and is required to compute the set of minimal transversals of its edges. 

The HTR problem has a lot of equivalent manifestations which appear in various 
branches of computer science. Examples in AI include computing abductive diagnoses (Re- 
iter, 1987), enumerating prime implicants in ATMS (Reiter & De Kleer, 1987), and Horn 
approximations (Kavvadias et al., 1993) which are closely related to characteristic models. 
Other areas include database theory (Mannila & Raiha, 1986), Boolean complexity, and 
distributed systems (Eiter & Gottlob, 1991). A comprehensive study of these problems is 
presented by Eiter and Gottlob (1994). HTR is also equivalent to the problem of dual- 
ization of monotone Boolean expressions, which is the form in which we present it here. 
This problem, requires translation between the CNF and DNF representations of monotone 
functions. 

The complexity of the HTR problem has been studied before (Fredman & Khachiyan, 
1994; Eiter & Gottlob, 1994; Kavvadias et al., 1993) and is still an open question. On one 
hand a class of problems which are "HTR complete" has been defined and studied (Eiter 
& Gottlob, 1994). This class includes many problems from various application areas which 
are equivalent to HTR (under polynomial reductions). On the other hand the problem is 
probably not NP-Complete. Recently, Fredman and Khachiyan (1994) have presented a 
sub-exponential n'^i^^sn-) time algorithm for the HTR problem. 

We first show that the problem CCM is at least as hard as HTR. By that we mean that 
if there is an output polynomial algorithm for CCM then there is an output polynomial 
algorithm for HTR. This has been stated as an open problem by Kavvadias et. al. (1993), 
who proved a similar hardness result for SID. Both hardness results can be alternatively 
derived by combining previous results in database theory (Eiter & Gottlob, 1994; Bioch & 
Ibaraki, 1993) and its relation to our problems (Khardon et al., 1995). 

We then consider two relaxations of these translation problems. The first is considering 
redundant Horn expressions which contain all the Horn prime implicates for a given ex- 
pression. The output of SID is therefore altered to be the set of all prime implicates, and 
similarly the input of CCM includes all the prime implicates instead of a minimal subset. 
It is shown that in this special case, SID, CCM, and HTR are equivalent under polynomial 
reductions. Therefore, the algorithm presented by Fredman and Khachiyan (1994) can be 
used to solve CCM, and SID in time This result can be alternatively derived from 

the results on functional dependencies in MAK form (Eiter & Gottlob, 1991). We show 
however that our argument generalizes to the larger family of A;-quasi Horn expressions. 

The second relaxation is the problem of computing all the prime implicants for a given 
Horn expression. This is a relaxation of CCM since using the prime implicants one can 
compute the characteristic models. Interestingly, the algorithm for HTR (Fredman & 
Khachiyan, 1994) can be adapted to this problem, resulting an algorithm with time com- 
plexity n'^(los''^). 

It is shown, however, that both relaxations do not help in solving the general cases of 
CCM and SID due to exponential gaps in the size of the corresponding representations. 

Lastly, we consider a related problem, denoted EOC, which is a minor modification of 
CCM and SID. This problem is shown to be co-NP-Complete. This serves to highlight some 
of the difficulty in finding the exact complexity of our problems. A variant of this result, 
has already appeared in the database literature (Gottlob & Libkin, 1990). 
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Figure 1: Summary of Complexity Results 



Our results are summarized in Figure 1, where a hierarchy of problems is depicted. 
The problem EOC is co-NP-Complete. The problem CMI is a special case of EOC, and is 
equivalent to SID and CCM. The problem HTR is a special case of CMI and is equivalent 
to SID and CCM under the restriction that the Horn expression is represented by the set 
of all prime implicates. 

The rest of the paper is organized as follows. Section 2 defines characteristic models, 
describes some of their properties, and formally defines the problems in question. Section 3 
discusses the relation between CCM, SID and the corresponding decision problem. Section 4 
discusses the relation to the HTR problem. We first establish the hardness result, and then 
consider the two relaxations mentioned above. Section 5 shows that EOC is co-NP-Hard, 
and Section 6 concludes with a summary. 

2. Preliminaries 

This section includes the basic definitions, and introduces several previous results which are 
used in the paper. 

We consider Boolean functions / : {0, 1}" {0, 1}. The elements in the set {xi, . . . , Xn} 
are called variables. Assignments in {0,1}" are denoted by x,y,z, and weight(x) denotes 
the number of 1 bits in the assignment x. A literal is either a variable Xi (called a positive 
literal) or its negation afj (a negative literal). A clause is a disjunction of literals, and a CNF 
formula is a conjunction of clauses. For example (xi Mx^) A (x^ V aTf V X4) is a CNF formula 
with two clauses. A term is a conjunction of literals, and a DNF formula is a disjunction 
of terms. For example (xi A X2) V (x^ A aTf A X4) is a DNF formula with two terms. A CNF 
formula is Horn if every clause in it has at most one positive literal. A formula is monotone 
if all the literals that appear in it are positive. The size of CNF and DNF representations 
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is, respectively, the number of clauses and the number of terms in the representation. We 
denote by \DN F(f )\ the size of the smallest DNF representation for /. 

An assignment x G {0, 1}" satisfies / if f(x) = 1. Such an assignment x is also called 
a model of /. By "/ implies , denoted / |= g, we mean that every model of / is also 
a model of g. Throughout the paper, when no confusion can arise, we identify a Boolean 
function / with the set of its models, namely Observe that the connective "implies" 

(1=) used between Boolean functions is equivalent to the connective "subset or equal" (C) 
used for subsets of {0, 1}". That is, / |= ^ if and only if / C ^. 

A term t is an impUcant of a function /, if t \= f. A term i is a prime impUcant of a 
function /, if t is an implicant of / and the conjunction of any proper subset of the literals 
in t is not an implicant of /. 

A clause d is an implicate of a function /, if f \= d. A clause c? is a prime implicate of a 
function /, if d is an implicate of / and the disjunction of any proper subset of the literals 
in d is not an implicate of /. 

It is well known that, a minimal DNF representation of / is a disjunction of some of its 
prime implicants. A minimal CNF representation of / is a conjunction of some of its prime 
implicates. 

If / is monotone, then it has a unique minimal DNF representation (using all the prime 
implicants), and a unique minimal CNF representation (using all its prime implicates). 

2.1 Characteristic Models 

The idea of using characteristic models as a knowledge representation was introduced by 
Kautz et. al. (1995). Characteristic models were studied in AI (Dechter & Pearl, 1992; 
Kavvadias et al., 1993; Khardon & Roth, 1994) and under a different manifestation in 
database theory (Beeri et al., 1984; Mannila & Raiha, 1986; Gottlob & Libkin, 1990; Eiter 
& Gottlob, 1991, 1994). This section defines characteristic models and their basic properties. 

For u,v £ {0, 1}", we define the intersection of u and v to be the assignment z G {0, 1}" 
such that Zi = 1 if and only if Ui = 1 and Vi = 1 (i.e., the bitwise logical-and of u and v.). 

For a set of assignments S , x = intersect(S ) is the assignment we get by intersecting 
all the assignments in S . We say that S is redundant if there exists x £ S and S' C S such 
that X ^ S' and x = inter sect{S'). Otherwise S is non-redundant. 

The closure of 5 C {0, 1}", denoted closure(S ), is defined as the smallest set containing 
S that is closed under intersection. 

To illustrate these definitions consider the set M = {1101,1110,0101}. Then M is 
non-redundant, intersect(M) = 0100, and closure(M) = {1101,1110,0101,0100,1100}. 

Let H he a Horn expression. The set of the Horn characteristic models of H , denoted 
here char(H) is defined as the set of models of H that are not the intersection of other 
models of H . Note that char(H) is non-redundant. Formally, 

char(H) = {u e H \ u ^ closure(H \ {u}) }. (1) 

For example, char{{1101, 1110, 0101, 0100}) = {1101, 1110, 0101}. 

It is well known that the set of models of Horn expressions is closed under intersection. 
This result is due to McKinsey (1943), who proved it for a certain class of first order sen- 
tences. Alfred Horn (1951) considered a more general class of sentences. (Lemma 7 by Horn 
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(1951) deals with the propositional case. Dechter and Pearl (1992) present another proof for 
the propositional case.) Moreover, since characteristic models capture all the information 
about the closure, they also capture all the information about the Horn expression. 

Theorem 1 (Kautz et al., 1995; Dechter & Pearl, 1992) Let H be a Horn expression 
then H = closure(char(H )) . 

2.2 Monotone Theory and Characteristic Models 

The monotone theory was introduced by Bshouty (1993), and was later used for a theory 
for model-based reasoning (Khardon & Roth, 1994). This section explores the relations 
between the monotone theory and characteristic models. 

Definition 1 (Order) We denote by < the usual partial order on the lattice {0, 1}", the 
one induced by the order < 1. That is, for x,y £ {0, 1}", x < y if and only if^fi, Xi < yi. 
For an assignment b G {0, 1}" we define x <b y if and only if x Q) b < y ®b (Here ® is the 
bitwise addition modulo 2). We say that x > y if and only if x > y and x y. 

Intuitively, if bi = then the order relation on the ith bit is the normal order; if bi = 1, 
the order relation is reversed and we have that 1 0. For example 0101 <iiii 0100, and 
0101 0110. We now define: 

The monotone extension of z £ {0, 1}" with respect to b: 

Mb{z) = {x \ X >b z}. 
The monotone extension of f with respect to b: 

JUbif) = {x \ X >b z, for some z G /}. 
The set of minimal assignments of f with respect to b: 

min6(/) = {z \ z e f, such that My e f,z v}- 

For example 

>liiii(0101) = {0101,0001,0100,0000}, and 

>liiii(1100) = {1100,0100,1000,0000}. 

Let / = bc(a W d)(a V d), then in the set notation / = {1100,0101}, and Aliiii(/) = 
{0101, 0001, 0100, 0000, 1100, 1000}. The set miniiii(/) = {1100, 0101}, and the set 
minoooi(/) = {0101}. 

Clearly, for every assignment b G {0, 1}", / C JUbif )- Moreover, if 5 ^ /, then b ^ 
■Mbif ) (since b is the smallest assignment with respect to the order <&). Therefore: 

/= A Mb{f)= /\Mb{f). 

6e{0,l}" 6^/ 

The question is if we can find a small set of negative examples, and use it to represent / as 
above. 
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Definition 2 (Basis) A set B is a basis for / if f = Aibi f)- B is a basis for a class 

of functions T if it is a basis for all the functions in T . 

Using this definition, we get an alternative representation for functions 

f=^MAj)=t\ V -^^(^)- (2) 
6eB 6eB^gniin5(/) 

It is known that the set B^ = {m G {0, 1}" | weight(M) > n — 1}, is a basis for any Horn 
CNF function. For example consider the Horn expression W = (be d)(cd b)(bc a) 
discussed in the introduction. Recall that the satisfying assignments of W are: 

modelsiW) = {0000,0001,0010,0100,0101,1000,1001,1010,1100,1101,1111}. 

We have to compute the sets miVLbiW) for b G Bh, where Bh = {HH, IHO, 1101, 1011, 0111} 
Note that if b satisfies / then min5(/) = {5}, and ^Abif ) = 1 (that is, \fx, ^Abif )ix) = 1). 
Therefore, miniiii(TU) = {1111}, and miniioi(TU) = {1101}. One way to compute the sets 
of minimal assignments is by drawing the corresponding lattices and noting the relations 
there. Figure 2 shows the lattice with respect to 5 = 0111. The satisfying assignments 
of W are marked in bold face. The minimal assignments are underlined, and some of the 
order relations, which show that the rest of the assignments are not minimal, are drawn. To 
compute ^AbiW) we have to add any assignment which is above the minimal assignments. 
This is marked by the dotted lines which show that 1011 and 1110 are in J^oiii(W). 
Using the figure we observe that minoiii(TU) = {1111,0101,0010}. The other sets are 
miniiio(TU) = {1111, 1100, 1010}, and minioii(TU) = {1111, 1001, 1010}. 

It is known that the size of the basis for a function / is bounded by the size of its CNF 
representation, and that for every b the size of min5(/) is bounded by the size of its DNF 
representation. 

For any function / and set of assignments B let: 

= mmB(f ) = ^beB{z G min6(/)}. 
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The following theorem gives an alternative way to define char(H). 

Theorem 2 (Khardon & Roth, 1994) Let H be a Horn expression. Then char(H) = 
■ 

Continuing the above example with the function W = (be d)(cd b)(bc a), 
we conclude that char(W) = {0010,0101,1001,1010,1100,1101,1111}. As the following 
theorem shows the set of characteristic models can be used to answer deduction queries. 

Theorem 3 (Kautz et al., 1995; Khardon & Roth, 1994) Let Hi, H2 be Horn 
expressions then Hi |= H2 if and only if for all x G char(Hi), H2(x) = 1. 

It is useful to have the DNF representation of a function. If / is given in its DNF 
representation then it is easy to compute the set minb(f ), for any b. Each term in the 
DNF representation can contribute at most one assignment, mmb(t), where the variables 
that appear in the term are fixed and the others are set to their minimal value. This is true 
since from every other satisfying assignment of the term we can "walk down the lattice" 
towards this assignment, on a path composed of satisfying assignments. For example, the 
minimal assignment for the term t = X1X3, with respect to the basis element b = 0011, 
is minooii(0 = {1001}. The assignment 1100 which also satisfies t is not minimal since 
1001 <ooii 1101 <ooii 1100. Further, once we have one assignment from each term, it 
is easy make sure that the set is non-redundant by checking which of the assignments 
generated is in the intersection of the others. We would use this algorithm later in some of 
our reductions. 

We say that a function is 5-monotone if it is monotone according to the order relation 
<5. Namely, if whenever f(x) = 1 and y >b x then f(y) = 1. Notice that if we rename 
the variable Xi by its negation, for each i such that bi = 1 (i.e. where the order relation 
is reversed), then / becomes monotone. Therefore, 5-monotone functions enjoy similar 
properties. For example, they have unique minimal DNF and CNF representations. Another 
property is that the minimal assignment which corresponds to every term is indeed part of 
the set min5(/). 

Claim 1 (Khardon & Roth, 1994) For any b-monotone function f, there is a 1-1 cor- 
respondence between the prime implicants of f and the set minb( f ). Namely: 

(1) for every term t in the minimal DNF representation for f, the assignment minb(t) is in 
minbif). 

(2) \minb{f)\ = \DNFif)\. 

We would also use the notion of a least upper bound of a Boolean function (Selman & 
Kautz, 1991), which can sometimes be characterized by the monotone theory. 

Definition 3 (Least Upper-bound) Let T , Q be classes of Boolean functions. Given 
f £ J- we say that g £ Q is a Q -least upper bound of f if and only if f C g and there is no 
f'eQ such that f C f C g. 
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Theorem 4 (Khardon & Roth, 1994) Let f be any Boolean function and Q a class of 
all Boolean functions with basis B. Then, fj^^ defined as 

£b = A -Mbif) 

beB 

is the Q-least upper bound of f. 

For the class of Horn expressions we have two ways to express the least upper bound. 
One using the monotone theory, and one using the closure operator: 

Theorem 5 (Dechter & Pearl, 1992; Kautz et al., 1995; Khardon & Roth, 1994) 

Let f : {0, 1}" {0, 1} be a Boolean function. Then fj^f = closure( f ), and char{ fj^f ) C 
/. 

For example consider the function / = {be d)(cd b)(bc a)(a W b W cW d). The 
function / satisfies all the assignments as W above except for 0001. However, 
inter secti {0101, 1001}) = 0001, and therefore /^^f = W. 

2.3 The Computational Problems 

This section includes definitions for all the problems discussed in this paper. Let H he a 
CNF expression in Horn form, and let char(H) be its set of characteristic models. The 
translation problems considered are: 

CCM: Computing Characteristic Models 
Input: a Horn CNF H. 
Output: the set char(H). 

SID: Structure Identification (Computing Horn Expressions) 

Input: a set of assignments F. 

Output: a Horn CNF H, such that F = char(H). 

HTR: Hypergraph Transversals (Dualization of Monotone Expressions) 
Input: a monotone CNF expression C . 

Output: a monotone DNF expression D, such that C = D. 

The decision problems discussed: 

CMI: Characteristic Models Identification 

Input: a Horn CNF H , and a set G of satisfying assignments of H . 
Output: Yes iff char(H) C G. 

Note: The condition is equivalent to H |= closure(G), and essentially also to G = char(H). 

EOC: Entailment of Closure 

Input: a Horn CNF H , a set G of assignments. 

Output: Yes if and only if H |= closure(G). 

We also discuss the following variant of CMI: 

CMIC: Characteristic Models Identification with Counter example 
Input: a Horn CNF i7, a set G of satisfying assignments of H . 

Output: If Char(H) C G then output Yes. Otherwise, output No and supply a counter 
example x G Char(H) \ G. 



358 



Horn Expressions and Characteristic Models 



2.4 Polynomial Time Algorithms and Reductions 

As mentioned above we need to define algorithms that are polynomial with respect to their 
output. There is more than one way to give such a definition. (A discussion of this issue is 
given by Eiter and Gottlob (1994).) We use the weakest^ of those which is called an output 
polynomial algorithm. 

When the output of a problem P is uniquely defined, we say that an algorithm A is an 
output polynomial algorithm for P if it solves P correctly in time which is polynomial in 
the size of its input and output. This is the case with HTR, and CCM. 

When the output of a problem P is not uniquely defined, we consider the shortest 
permissible output 0(1) for input /. We say that an algorithm A is an output polynomial 
algorithm for P if it solves P correctly in time which is polynomial in the size of its input 
/ and the size of 0(1). We note that for SID the output is not uniquely defined since there 
is no unique minimal representation for Horn functions. 

We define polynomial reductions with respect to an oracle (i.e. we use Turing reducibility 
(Garey & Johnson, 1979)). A problem PI is polynomially reducible to a problem P2 if there 
is an output polynomial algorithm that solves PI when given access to (1) an output 
polynomial subroutine for P2, and (2) a polynomial bound^ on the running time of the 
subroutine. 

3. Translating is Equivalent to Deciding 

In this section we show that the problems CCM, SID, CMI, and CMIC are equivalent under 
polynomial reductions. Namely, both translation problems are solvable in polynomial time 
if and only if the corresponding decision problem CMI is solvable in polynomial time. 

Theorem 6 The problems CCM, SID, CMI, and CMIC are equivalent under polynomial re- 
ductions. 

Proof: The proof is established in a series of lemmas. In particular we show that CMIC < 
CMI < SID < CMIC, and that CMI < CCM < CMIC, where < denotes "is polynomially 
reducible to", in Lemma 1, Lemma 2, Lemma 3, Lemma 4, and Lemma 5 respectively. ■ 

Lemma 1 The problem CMIC is polynomially reducible to the problem CMI. 

Before presenting the proof consider how a similar result is achieved for the satisfiability 
problem (Garey & Johnson, 1979). Namely, how a decision procedure for satisfiability can 
be used to construct an algorithm that finds a satisfying assignment if one exists. Suppose 

1. Other related notions which we do not use here are "enumeration with polynomial delay" and "enumer- 
ation with incremental polynomial delay" (Eiter & Gottlob, 1994). These require that the algorithm 
will compute the elements of its output one at a time, and restrict the time delay between consecutive 
outputs. Incremental polynomial delay allows the delay to depend on the problem size and on the num- 
ber of elements computed so far. Polynomial delay is stricter in that it requires dependence only on the 
problem size. Both of these notions are stricter than output polynomial algorithms since the latter may 
wait a long time before computing its first output. Unfortunately, most of our reductions yield output 
polynomial algorithms, and we cannot guarantee that the stronger notions hold. 

2. That is, a polynomial in the dimension of the problem (the number of variables), the input size, and the 
output size. 
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we have a formula C, and that we know that it is satisfiable. (We used the decision procedure 
to find that out.) Our task is to find a satisfying assignment for it. What we do is substitute 
xi = into C yielding a formula C° with n — 1 variables. The formula C° is satisfiable if 
and only if C has a satisfying assignment in which Xi = 0. We run the decision procedure on 
C°. If the answer is Yes then we know that C has a satisfying assignment in which Xi = 0. 
If the answer is No then since C is satisfiable, it must have a satisfying assignment in which 
xi = 1. In either case we found a substitution for xi which guarantees the existence of a 
satisfying assignment. All we have to do is to recurse with this procedure on C°. 

An example can clarify this a bit more. Suppose we have the expression C = (aVc)(5Vc) 
which is satisfiable. To find a satisfying assignment we substitute a = to get C° = (c)(5Vc), 
and run the decision procedure on C°. The answer is Yes, and therefore we continue with 
C°. We next substitute 5 = to get C°° = cc. We run the decision procedure again, and 
the answer is No. Therefore we conclude that we must substitute 5=1 instead of 5 = 0. 
This yields = c. We then continue to find that c must be assigned 1 and altogether we 
find the satisfying assignment abc = 011. 

We would like to use the same trick here. However, G is given as a set of models and 
we cannot perform this substitution procedure as easily"^. Nevertheless, as the proof shows 
something similar can be done. 

Proof: First observe that we have a solver for CMI. Therefore if the answer is Yes we have 
no problem, we can simply answer Yes. A problem arises in the case where the answer is 
No. In this case CMI is happy with saying No, but CMIC must provide a counter example. 

Formally, we get H, G as input to CMIC and an algorithm A to solve CMI. We run A 
on H, G as an input, and if A replies Yes we reply Yes. Otherwise we know that there exits 
an a; G char(H) \ G. We need to find such a model and return it as the output of CMIC. 

Consider first the easier task of finding x G H\closure(G); the assignment a; is a witness 
for the fact H ^ closure(G). 

Recall the substitution trick from above, and observe that for Xi = 1 a similar substitu- 
tion works. For H we simply perform the substitution to get an expression H , and for G 
we remove any z £ G m which Zi = to get the set G. We claim that there is a witness for 
H, G with Xi = 1 if and only if there is a witness for H , G. This follows from the fact that 
X G closure(G) and Xi = 1 if and only if a; G closure(G). To see that, let x G closure(G), 
such that Xi = 1; if a; = intersect(S ), and y £ S then yi = 1, and therefore x G closure(G). 
Also if a; G closure(G) then x G closure(G). Therefore, if there is a witness x with Xi = 1 
then we can detect this fact by presenting A with H,G as input (on which it will say No). 

This however does not work for Xi = 0. In this case an element in the closure requires 
at least one element in S with yi = 0, but we have no information on the other elements. 
Therefore we can not perform the recursion in the case where substitution of Xi = is 
required. 

We circumvent this problem using the following iterative procedure. In each stage we 
try to turn one more variable to 1. For all i, we make the experiment described above of 
substituting Xi = 1. If the answer is No, for some i, we can proceed to the next stage, just 
as before (ignoring tests for other values of i). If the answer is Yes for all i, then we know 

3. Furthermore, the closed form one can derive using the monotone theory (Khardon & Roth, 1994) does 
not seem to be useful. 
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that for each Xi that did not receive a value so far, there is no witness with Xi = 1, so 
the only possible witness is the one assigning to all the variables. We return the witness 
X G {0, 1}" arrived at, by the above substitutions, as the counter example of CMIC. 

From the construction it is clear that x £ H \ closure(G), but the requirement of CMIC 
is that X G char(H) \ G. We claim that this stronger condition holds. Suppose not, and 
let S C char(H) be such that x = intersect(S ). Then clearly S is not a subset of G or 
otherwise x G closure(G). Let y £ S \ G, then since x = intersect(S ), we get x <o" y. 
Namely, if Xi = 1 then yi = 1. But this is a contradiction, since in the last run of the 
algorithm A for CMI, it was concluded that no more variables could be set to 1, while still 
maintaining a witness. ■ 

We exemplify the proof using the function W = (be d)(cd b)(bc a) presented 
in the introduction. Recall that char(W) = {0010,0101,1001,1010,1100,1101,1111}, and 
suppose that so far we found G = {0010, 1001, 1010, 1100, 1101, 1111}. That is, all but the 
model 0101. We run CMI on W, G and, since G does not include all the characteristic mod- 
els, it answers No. In order to find the counter example we make 4 separate substitutions, 
one for each variable substituted to 1. 

Consider the substitution with 5=1. This yields W = (c ^ d)(c a), and G = 
{IsOO, IsOl, Isll}, where we use s to mark that the variable b was substituted. We run 
CMI on W,G and it finds out that there is a counter example (the assignment OsOl is in 
W but not in closure(G)), and therefore it answers No. That means we can continue our 
algorithm with 5=1. We forget all the information from the other substitutions (that were 
not considered in detail) and continue to the next step. 

In the next step we substitute 1 to each of a, c, d. Consider first the substitution for a. 
This yields W = (c ^ d) and G = {ssOO, ssOl, ssll}. Running CMI on this pair we get 
the answer Yes. Namely W = closure(G). Consider now the substitution for d. This yields 
W = (c ^ a) and G = {IsOs, Isls}. Running CMI on this pair we get the answer No (since 
OsOs is a counter example). We can therefore recurse on this value. 

In the next iteration both substitutions for a and for c, yield the answer Yes, and 
therefore we substitute to both to get the final counter example abed = 0101. 

Using this example it is easy to see that one can improve the running time of the 
reduction by simply remembering the attributes for which we received the answer Yes. 
These attributes will have to get the value in the end. In this way we can scan the 
variables one by one, and recurse on the first that yields the answer No. This requires only 
n calls to CMI. 

Lemma 2 The problem CMI is polynomially redueible to the problem SID. 

Proof: We are given an output polynomial time algorithm A for SID, and a polynomial 
bound on its running time (that is, a polynomial in the number of variables ra, the input 
size, and the output size). Given II,G as input to CMI, we run A on G until it stops and 
outputs H' or until it exceeds its time bound (with respect to the size of H). In the first 
case we check whether H = H' (which can be done in polynomial time (Dowling & Gallier, 
1984)) and answer accordingly. In the second case we know that the real Horn expression 
which corresponds to G is larger than H and therefore we answer No. ■ 
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The proof of the next lemma draws on previous results in computational learning theory. 
In this framework a function / : {0, 1}" {0, 1} is hidden from a learner that has to 
reproduce it by accessing certain "oracles". A membership query allows the learner to ask 
for the value of the function on a certain point. 

Definition 4 A membership query oracle for a function f : {0, 1}" {0, 1}, denoted 
MQ(f ), is an oracle that when presented with x G {0, 1}" returns f{x). 

An equivalence query allows the learner to find out whether a hypothesis he has is 
equivalent to / or not. In case it is not equivalent, the learner is supplied with a counter 
example. 

Definition 5 An equivalence query oracle for a function f : {0, 1}" {0, 1}, denoted 
EQ{ f ), is an oracle that when presented with a hypothesis h : {0, 1}" {0, 1}, returns Yes 
if f = h. Otherwise it returns No and a counter example x such that f(x) h(x). 

We use a result that has been obtained in this framework. 

Theorem 7 (Angluin, Frazier, & Pitt, 1992) There is an algorithm A, that when given 
access to MQ( f ) and EQ{ f ), where f is a hidden Horn expression, runs in time polynomial 
in the number of variables and in the size of f, and outputs a Horn expression H which is 
equivalent to f. 

The hypothesis h, in the algorithm's accesses to EQ{f ), is always a Horn expression. 

The following lemma, and the simulation in its proof, are implicit in previous works (Dechter 
& Pearl, 1992; Kautz et al., 1995; Kivinen & Mannila, 1994). 

Lemma 3 The problem SID is polynomially reducible to the problem CMIC. 

Proof: We are given G as input to SID, and a polynomial time algorithm C for CMIC. 
Our algorithm will run the algorithm A from Theorem 7 and answer the MQ and EQ 
queries that A presents. 

Given x G {0, 1}" for MQ the algorithm tests whether x G closure(G). This can be 
done by testing whether x is equal to the intersection of all elements y m G such that y > x. 

Given a Horn expression h for EQ (the theorem guarantees that the hypothesis is a Horn 
expression), we have to test whether h = closure(G). We first test whether closure(G) C h, 
which is equivalent to closure(G) |= h. Theorem 5 together with Theorem 3 imply that 
if the answer is No, then for some x £ G, h(x) = 0. Such an a; is a counter example for 
the equivalence query, and the test can be performed simply by evaluating h on all the 
assignments in G. 

If closure(G) |= h, namely all the assignments in G satisfy h, we present h, G as input 
to the algorithm C for the problem CMIC. The input to CMIC is legal. C may answer Yes, 
meaning char(h) C G, which implies h C closure(G). In this case we answer Yes to the 
equivalence query. Otherwise C says No and supplies a counter example x G char(h) \ G. 
Since G C h we get x G h\closure(G) and therefore we can pass a; on as a counter example 
to the equivalence query. ■ 
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We next consider the problem CCM: 

Lemma 4 The problem CMI is polynomially reducible to the problem CCM. 

Proof: We are given an output polynomial algorithm C for CCM, and a polynomial bound 
on its running time (that is, a polynomial in the number of variables ra, the input size, and 
the output size). Given H,G as input to CMI, we run C on H until it stops and outputs 
G' or until it exceeds its time bound (with respect to the size of G). In the first case we 
compare G and G' and answer accordingly. In the second case we know that the set of 
characteristic models of H is larger than G and therefore we answer No. ■ 

Lemma 5 The problem CCM is polynomially reducible to the problem CMIC 

Proof: Given H as input for CCM, an algorithm for CMIC can be used repeatedly to 
produce the elements of char(H). 

We start with G = 0. In each iteration we run CMIC on H, G to get a new characteristic 
model which we add to G. Once we find all the characteristic models CMIC will answer Yes. 
(In fact, if CMIC is polynomial in its input size then we get an "incremental polynomial 
algorithm" (Eiter & Gottlob, 1994) which is even stronger than "output polynomial" as 
required here.) ■ 

4. The Relation to Hypergraph Transversals 

In this section we establish the relation to the hypergraph transversal problem. We first 
show that our problems are at least as hard as HTR. We then consider two relaxations of 
SID and CCM. The first relaxation considers redundant representation for Horn expressions, 
which includes all the prime implicates. The second relaxation considers computing prime 
implicants instead of characteristic models. Both of these relaxations enjoy sub-exponential 
algorithms. It is shown, however, that the relaxations do not help in the general case, as a 
result of exponential gap in the size of the corresponding representations. 

4.1 The Reduction to HTR 

The problem HTR is defined as computing a DNF representation for a monotone function 
given in its CNF form. It is easy to observe that this is equivalent to computing a CNF 
representation for a monotone function given in its DNF form. (We can simply exchange 
the V and A operations to get one problem from the other). We can therefore assume 
that the input for HTR is given as either a DNF or a CNF. Another useful observation is 
that renaming the variables does not change the problem. Therefore if we rename every 
variable as its negation (namely, replace every Xi with afj), we get the equivalent problem of 
translating between functions which are monotone with respect to the order relation <i". 
We call such functions anti-monotone. This is useful since anti-monotone functions have 
CNF representations in which all variables are negated, which is a special case of Horn 
expressions. Having these observations, the next two theorems follow almost immediately 
from the definitions, given the correspondence between minimal elements and prime impli- 
cants described in Claim 1. The following result has been stated as an open problem by 
Kavvadias et. al. (1993). 



363 



Khardon 



Theorem 8 The problem HTR is polynomially reducible to the problem CCM. 

Proof: Let A be an algorithm for the problem CCM. We construct an algorithm B for 
the problem HTR. We may assume that the input is an anti-monotone CNF, C, and we 
want to compute its anti-monotone DNF representation. 

The basic idea is that using Claim 1 we know how to compute the DNF from minin(C), 
and that the latter is a subset of the characteristic models. So all we need to do is let A 
compute the characteristic models, identify the set minin(C), and compute the DNF. 

More formally, the algorithm B runs A to compute F = char(C) = minB^(C), and 
computes the set Fi^ ={z£T\\fy£T,z <fi\n y}. Namely the elements of F which are 
minimal with respect to the order relation 5=1". It then computes the anti-monotone 
DNF expression D = V^gPin ^z,=o 'xj, which it outputs. 

The correctness of the algorithm follows from Claim 1 which guarantees that the com- 
putation of the DNF from the set of characteristic models is correct. 

As for the time complexity we observe, using Claim 1, that F is not considerably larger 
than the size of the DNF. This is true since for all 5, \DN F(f )\ = |minin(/)| > |min5(/)|, 
and \Bh\ = n + 1. ■ 

To exemplify the above reduction, suppose that we have only three variables a, 5, c, and 
that the input is C = (a V 5)(5 Vc). (The satisfying assignments are 000,001,010,100,101, 
and the required DNF expression is a c V b.) The algorithm A will compute the set of 
characteristic models char(C) = {101,010,100,001}, from that we find that minin(C) = 
{101, 010}. The term which corresponds to 101 is 5, and the term which corresponds to 010 
is a c and indeed we get the right DNF expression. 

Using the monotone theory one can give a simple proof for the following theorem, which 
has already been proved by Kavvadias et. al. (1993). 

Theorem 9 (Kavvadias et al., 1993) The problem HTR is polynomially reducible to the 
problem SID. 

We note that both theorems can be deduced by combining results in database theory 
(Eiter & Gottlob, 1994, 1991; Bioch & Ibaraki, 1993) and using the above mentioned 
equivalence with problems in database theory (Khardon et al., 1995). 

4.2 Enumerating Prime Implicates 

Having obtained the hardness results in the previous sub-section, a natural question is 
whether CCM, and SID are as easy as HTR. This would help settle the exact complexity of 
the problems discussed, and more importantly would imply a sub-exponential algorithm for 
the problem. While no such reduction has been found, we show here that it holds in a special 
case. We show, however, that the solution obtained in this way may need exponential time 
in the general case. 

This result has already been obtained in the database domain (Eiter & Gottlob, 1991), 
where restrictions of functional dependencies to be in MAK form is discussed. Our argu- 
ment, however, can be generalized to richer languages, and in particular holds for the family 
of A;-quasi Horn expressions defined below. 
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In particular we relax the problems so as to use the largest Horn expression for a 
function instead of using a small Horn expression. In this case the problem SID amounts 
to computing all the (Horn) prime implicates of the function identified by T. For CCM we 
have to compute the set of characteristic models given the set of all prime implicates rather 
than a small expression. 

We would use the following example to illustrate the notions in this sub-section. Con- 
sider the function W = (a ^ b)(c b)(b V d). The satisfying assignments of W are 
W = {0000,0001,0100,0110,1100,1110}, and the characteristic models are char(W) = 
{0001,0110,1100,1110}. One can verify that W |= (cWd)(aWd), and that these are the 
only additional Horn prime implicates of W. 

For CCM, this section asks whether it is easier to compute the characteristic models 
starting with the equivalent expression W = (a ^ b)(c b)(bW d)(cW d)(aW d). For SID the 
question is whether it is easier to output the whole set rather than just a minimal subset. 
These are relaxations of the problems since, an algorithm for SID is allowed more time to 
compute its output, and CCM is given more information and more time for its computation. 

Let / be a Horn expression, then using the monotone theory representation (Equa- 
tion (2)) we know that 

/ = AbeB^Mbif). (3) 

Recall that Bh = {u £ {0, 1}" | weight(M) > ra — 1}, and denote by 1 < i < n, the 
assignment with Xi set to zero and all other bits set to 1, and by the assignment 1". In 
our example = 1111, and b^^^ = 0111. 

Let Di be the set of clauses that are falsified by and let Qi denote the language of all 
CNF expressions with clauses from Di. In our example, with four variables a, 5, c, d, clauses 
in Di may have b,c,d as negative literals and a as a positive literal. That is, (a V 5) ^ Di, 
but (a V 5) G Di and (5 Vc) G -Di. 

Theorem 4 implies that ^Af^(,)(f ) is equal to the least upper bound of / in Qi. Namely, 
the intersection of all clauses in Di which are implied by /. Define PI( f, i) to be the set of 
prime implicates of / with respect to b^^h Formally: 

P/(/, i) = {rf G A|/ 1= d and Md' C dj ^ d'}. 

Using this notation we get: 

M,iM= A d. (4) 
dePi(f,i) 

Going back to the example W, we have: 

PI(W,0) = (b\/d)(c\/d)(a\/d) 

PI(W,1) = (b\/d)(c\/d) 

PI(W,2) = (a b)(c-^ b)(cyd)(ayd) 

PI(W,3) = (b\/d)(a\/d) 

PI{W, 4) = true. 

Note that the partition of the prime implicates of / is not disjoint. In particular, the 
anti-monotone prime implicates (except for aTf V af2 V ... V Ic^ if it is a prime implicate) 
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appear in several PI(f,i) sets. Equation (3) tells us that we can decompose the function 
into n + 1, 5W -monotone functions. Equation (4) tells us how to decompose the clauses of 
the function, and the monotone theory tells us how to decompose the characteristic models. 
These observations lead to the following theorem: 

Theorem 10 The problem CCM, when the input is given as the set of all Horn prime 
implicates, is polynomially equivalent to HTR. 

Proof: First observe that the reduction in Theorem 8 uses an anti-monotone function, 
which has a unique Horn representation. Namely the smallest and the largest representa- 
tions are the same in this case. This implies that the problem remains as hard as HTR in 
this special case. 

For the other direction, we first partition the input into the sets P/(/, i), and then use 
a procedure for HTR in order to translate each set to a DNF representation. Then using 
Claim 1 we translate the DNF expression to the set of minimal assignments. The crucial 
point is that we have DNF representations for the functions ^Af^(,)(f) rather than for /. 
This implies that each term in these DNF representations is represented as an element in 
char(f ) and therefore the reduction is polynomial. (We may get some of the elements in 
char(f ) more than once, but at most n times, which is still polynomial.) ■ 

In our example, we get the following DNF expressions and their translation into assignments: 



PI( W, 0) 


= a b cW d ^ 


To 


= 0001,1110 


PI(W, 1) 


= bc\/d =^ 




= 0001,0110 


PI( W, 2) 


= bdW Tie =^ 


r2 


= 1110,0001 


PI( W, 3) 


= ab\/d =^ 




= 0001,1100 


PI( W, 4) 


= true =^ 


r4 


= 1110 



Similarly we get for SID: 

Theorem 11 The problem SID, when the output required is all Horn prime implicates, is 
polynomially equivalent to HTR. 

Proof: The proof is similar to the proof of the previous theorem. The hardness follows 
from Theorem 9. 

For the other direction, assume we get as input a set F, and an algorithm A for HTR. 
We first partition F into sets Fj- according to minimality with respect to b^''K (Note that the 
sets are not disjoint.) Then we use Claim 1 to transform each Fj- into a DNF expression for 
the function ^Af^(,)(f). For each such DNF expression we run the procedure A to compute 
its CNF representation. By Equation (3), the intersection, with respect to i, of these CNF 
expressions is the Horn expression we need. ■ 

In the example, we simply start with the sets Fj- and use the same equations as above 
going in the other direction. From the above two theorems we get the following corollary. 

Corollary 1 The problems CCM and SID, when the Horn expression is represented as 
the set of all Horn prime implicates, are polynomially equivalent, and are polynomially 
equivalent to HTR. 
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The equivalence of CCM and SID, in this special case, has been observed before in 
the database domain (Heikki Mannila, private communication). In fact this led us to the 
results of this section. As mentioned above a similar result for relational databases is 
reported by Eiter and Gottlob (1991) where the restriction is called the MAK form for 
functional dependencies. 

Lifting the Restriction: The polynomial equivalence to the problem HTR, implies the 
existence of sub-exponential n'^i^^sn-) algorithm for these problems which may have some 
practical implications. However, as the following example shows one cannot apply it to 
solve the general case of the problem SID. Aizenstein and Pitt (1995) present some functions 
with interesting properties. These functions can be manipulated to create examples with 
the following properties: (1) / has a short Horn expression, (2) \char(f )\ is small, (3) the 
number of Horn "prime implicates" is exponential. In particular 

/ = (^ V ^ V . . . V ^) A (si V yT) A (a;2 V yi") A . . . A (a;„ V J/;;:) 

has these properties. The set of prime implicates include all the disjunctions (5iV52V. . .V5m) 
where b, G {xj,yj}. 

We show by case analysis that the set of characteristic models is small. Observe that 
in order to satisfy /, at least one of the Xi variables must be assigned 0, and that if Xi = 
then yi must also be assigned 0. 

Consider first the set mini2m(/). Notice that if, for some j, Xj = yj = and all the 
other variables are set to 1, then / is satisfied. This contributes exactly m assignments 
to mini2m(/). For m = 3 and variable ordering xiX2X3yiy2y3, this yields the assignments 
011011, 101101, 110110. 

Consider next mmj^(x^)(f). Namely, the basis element in which Xi = 0. To satisfy /, 
if Xi = then yi must be 0, and as before we can set all other variables to 1. If Xi = 1 
then there must be another variable Xj which is set to 0. In this case yj must also be 0. 
Therefore niinjj(x,) (/) = min;^2m(/). 

Lastly, consider mmj^(y^)(f). Namely the basis element in which yi = 0. Observe that / 
is anti-monotone in yi. Namely, given any satisfying assignment with yi = 1, by hipping yi 
to we get another satisfying assignment, which is smaller than the original according to 
<^(j,,). Therefore, we may assume that yi = 0. If Xi = then we can set all other variables 
to 1. If Xi = 1 then there must be another variable Xj which is set to 0, and therefore also 
yj = 0. This assignment is 2 bits away from and it is minimal. We get m assignments 
in this case too. In our example with m = 3, and say i = 2, we get the assignments 101101, 
011001, and 110100. 

Altogether we get m assignments from the first two groups and m(m — 1) new assign- 
ments from the last and therefore \char(f )\ = w? . This means that arbitrary enumeration 
of the prime implicates, for a given set of models L, is not sufficient for solving SID. 

A Generalization: While we concentrate in this paper on Horn expressions, we note that 
the same arguments and proofs hold in the more general case of A;-quasi Horn expressions. 
These are expressions in CNF form where in every clause there are at most k positive 
literals (so that Horn expressions are 1-quasi Horn expressions). The set Bh,, = {m G 

{0, 1}" I weight(M) > ra — A;} is a basis for A;-quasi Horn expressions, and F . * can serve as 
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the set of characteristic models for / (Khardon & Roth, 1994). The generalized versions of 
CCM and SID, when restricted to hold all prime implicates are still equivalent to HTR. 

4.3 Enumerating Prime Implicants 

As mentioned above, given a DNF representation for / we can easily compute the set of 
characteristic models. One might therefore try to solve CCM by first translating the Horn 
expression into a DNF expression and then computing the characteristic models from this 
set. Another possible relaxation is to first compute all the prime implicants of the function 
and then to extract a DNF representation from it. We consider this problem here. Namely, 
we consider the problem of enumerating all the prime implicants of a Horn expression, and 
its application for the solution of CCM. 

While we have not found a general reduction from this problem to HTR, a simple 
adaption of the algorithm for HTR (Fredman & Khachiyan, 1994) yields an incremental 
^0(log n) algorithm for this problem. However, as we discuss below, enumeration of prime 
implicants of a Horn expression is not sufficient for solving CCM. The problem in such an 
application is an exponential gap in the sizes of these representations. 

For completeness we sketch the main ideas of the enumeration algorithm here. Let H 
be a Horn expression, and let D be the DNF expression composed of the prime implicants 
enumerated so far. The algorithm finds an assignment x which satisfies H and does not 
satisfy D. Using x it is easy to find a new prime implicant of H . The algorithm to find 
X uses the following combinatorial fact (Fredman & Khachiyan, 1994): either there is a 
variable Xi that appears m H A D with high frequency, or the expression HAD has "a 
lot" of satisfying assignments. In the first case, one can recursively solve two sub-problems 
arrived at by substituting Xi = 0, and Xi = 1 in the expressions H and D. In the second case 
it is easy to find an assignment x (e.g. by sampling). The solution of the recursion yields 
the stated time bound. For complete details we refer the reader to the article by Fredman 
and Khachiyan (1994). While the analysis there is specialized for monotone functions it is 
easy to extend (the first part of) it for Horn expressions'*. 

Lifting the Restriction: Denote by ^PIs{f) the number of prime implicants of/. While 
the representations (1) Prime Implicants (Pis), (2) DNF representation, and (3) Charac- 
teristic models, satisfy the inequalities ^PIs{f) > \DN F(f )\ > \char(f )\/n, each of the 
inequalities may allow for an exponential gap. The function 

/i = (xTV x^...y x~p^ V x^) A ... A (x^_^_^-^ V X 

V ... V Xn-i V Xn) 

(Khardon & Roth, 1994) shows a gap between (2) and (3). The function 

/2 = X1X2 ...x-rayxTwyx^my-'-yx^^y^ 

(Aizenstein & Pitt, 1995) shows a gap between (1) and (2). (To observe that, notice the 
similarity between /2 and the dual of the function from the previous sub-section.) Both 
functions are Horn (for /2 by multiplying out we see that every clause for / is Horn, 

4. One caveat that we have to tackle is enumerating prime implicants after D is already equivalent to H. 
This can be done using "consensus" operations, which can generate all the prime implicants (Aizenstein 
& Pitt, 1995) 
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although its Horn expression is large) and both have a small set of characteristic models. 
These examples show that enumeration of prime implicants may be an inefficient way for 
producing the characteristic models for some functions. 

5. A Related Problem 

In this section we show that a related problem, which is a minor variant of CCM and SID, 
is co-NP-Complete. Recall the definition of EOC: 

EOC: Entailment of Closure 

Input: a Horn CNF H , a set G of assignments. 

Output: Yes if and only if H |= closure(G). 

The important difference between CMI and EOC is that the set G is not required to 
include only satisfying assignments of H . This enables the following reduction for EOC, 
while the complexity of CMI is still open. A similar result in the database domain has been 
obtained by Gottlob and Libkin (1990). 

Theorem 12 The decision problem EOC is co-NP-Complete. 

Proof: The problem is trivially in co-NP (guess an assignment x and say "No" if a; G 
H \ closure(G)). 

To show its hardness we reduce co-Monotone 3-SAT to EOC. Monotone 3-SAT (Garey 
& Johnson, 1979) is the problem of satisfiability of CNF formulas in which in every clause 
(has 3 literals and) either all the literals are positive (we call these clauses monotone) 
or all the literals are negated (we call such clauses anti-monotone). Let / = M A A an 
instance of Monotone 3-SAT where M denotes a conjunction of monotone clauses and A 
is a conjunction of anti-monotone clauses. We translate it to the instance of EOC: H = A 
and r = U6gBij'^^'i6(Af). First we claim that the reduction is polynomial. Note that 
since M is a monotone CNF, M is a DNF formula in which all the variables are negated, 
and can therefore be written as an anti-monotone CNF formula. This implies that M is 
Horn, but we have it in a DNF representation. Further computing F is easy given the DNF 
representation of M, and its size is bounded by (ra -|- I) times the number of clauses in M . 

We now claim that / is satisfiable if and only \i H ^ closure(T). Assume first that / 
is satisfiable, and let x £ A A M . This implies that x £ H and x ^ M . Since M is Horn, 
and the models of Horn functions are closed under intersection (Theorem I) we get that 
X ^ closure(M), and since T C M x ^ closure(T). Therefore, H ^ closure(T). 

For the other direction assume H ^ closure(T), and let x be an assignment such that 
X £ H and x ^ closure(T). We get that x £ A, and since by Theorem I and Theorem 2 
M = closure(T) we have x ^ M . So, x £ A A M and / is satisfiable. ■ 

To exemplify the above reduction consider the function 

f = (a\/b\/ c){b Vc V d){a V c V d){a V 5 V c). 

This function will be translated into i7 = (aW bWc)(bWcW d). The function M = ac dWa be. 
The satisfying assignments of M are 0000,0001,0100, and F = char{M) = {0001,0100}. 
Now consider the assignment x = 1000 which satisfies /. Clearly, x satisfies H , and one 
can check that it is not in the closure of F. 
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6. Conclusions 

Horn expressions and characteristic models are two alternative representations for the same 
information and none of the two dominates the other in the computational services it can 
support. The same representations occur in database theory where they have a role in the 
design of relational databases. A natural question is whether we can translate back and 
forth between these representations so as to enjoy the benefits of both worlds. In this paper 
we have studied the computational complexity of these problems. 

Our main result is that the two translation problems CCM, and SID, are equivalent to 
each other (under polynomial reductions), and that they are equivalent to the corresponding 
decision problem CMI. Namely, translating in either direction is equivalent to deciding 
whether a given set of models is the set of characteristic models for a given Horn expression. 

We have also shown a close relation between our problems and the hypergraph transver- 
sal problem HTR. This is a translation problem which is related to many applications in 
computer science and in particular to AI. We have shown that in general CCM, and SID 
are at least as hard as HTR, and that in a special case CCM, SID, and HTR are equivalent. 

We exhibited examples which show that simple algorithms for enumerating prime im- 
plicants cannot guarantee efficient solution for CCM, and similarly enumerating prime im- 
plicates may not be efficient for SID. Lastly, we discussed the problem EOC, a minor 
modification of CMI, which is co-NP-Complete. The complexity hierarchy of the problems 
discussed is depicted in Figure 1. 

Some of the results presented in this paper can be obtained from previous results in 
database theory, using the equivalence between Armstrong relations and characteristic mod- 
els reported in a companion paper (Khardon et al., 1995). However, our proofs and expo- 
sition make these results much more accessible. 

The exact complexity of CMI, and that of HTR are left as open problems. While HTR 
has a sub-exponential algorithm, the problems CMI might still be co-NP-Hard. 
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